Lip Movements Generation at a Glance

نویسندگان

  • Lele Chen
  • Zhiheng Li
  • Ross K. Maddox
  • Zhiyao Duan
  • Chenliang Xu
چکیده

Cross-modality generation is an emerging topic that aims to synthesize data in one modality based on information in a different modality. In this paper, we consider a task of such: given an arbitrary audio speech and one lip image of arbitrary target identity, generate synthesized lip movements of the target identity saying the speech. To perform well in this task, it inevitably requires a model to not only consider the retention of target identity, photo-realistic of synthesized images, consistency and smoothness of lip images in a sequence, but more importantly, learn the correlations between audio speech and lip movements. To solve the collective problems, we explore the best modeling of the audio-visual correlations in building and training a lip-movement generator network. Specifically, we devise a method to fuse audio and image embeddings to generate multiple lip images at once and propose a novel correlation loss to synchronize lip changes and speech changes. Our final model utilizes a combination of four losses for a comprehensive consideration in generating lip movements; it is trained in an end-to-end fashion and is robust to lip shapes, view angles and different facial characteristics. Thoughtful experiments on three datasets ranging from lab-recorded to lips in-thewild show that our model significantly outperforms other state-of-the-art methods extended to this task.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Shape selectivity and remapping in dorsal stream visual area LIP.

We explore the visual world by making rapid eye movements (saccades) to focus on objects and locations of interest. Despite abrupt retinal image shifts, we see the world as stable. Remapping contributes to visual stability by updating the internal image with every saccade. Neurons in macaque lateral intraparietal cortex (LIP) and other brain areas update information about salient locations arou...

متن کامل

Functional outcomes of cleft lip surgery. Part II: Quantification of nasolabial movement.

OBJECTIVE To explore nasolabial movements in participants with repaired cleft lip and palate. DESIGN A parallel, three-group, nonrandomized clinical trial. SUBJECTS Group 1=31 participants with a cleft lip slated for revision surgery (revision), group 2=32 participants with a cleft lip who did not have surgery (nonrevision), and group 3=37 noncleft control participants. METHODS Three-dime...

متن کامل

Complications of Bilateral Sagittal Split Osteotomy in Patients with Mandibular Prognathism

Introduction: Bilateral sagittal split osteotomy (BSSO) of mandible is vastly used in treatment of mandibular deficiencies and discrepancies. Since this method could affect esthetic as well as function, evaluating these effects from various aspects is crucial. This study assessed the effects of this technique on the function of masseter muscle, jaw movements, and sensory changes along with fail...

متن کامل

A cerebral central pattern generator in Aplysia and its connections with buccal feeding circuitry.

Different feeding-related behaviors in Aplysia require substantial variations in the coordination of movements of two separate body parts, the lips and buccal mass. The central pattern generators (CPGs) and motoneurons that control buccal mass movements reside largely in the buccal ganglion. It was previously thought that control of the cerebral neuronal circuitry and motoneurons that generate ...

متن کامل

Activity in V4 reflects the direction, but not the latency, of saccades during visual search.

We constantly make eye movements to bring objects of interest onto the fovea for more detailed processing. Activity in area V4, a prestriate visual area, is enhanced at the location corresponding to the target of an eye movement. However, the precise role of activity in V4 in relation to these saccades and the modulation of other cortical areas in the oculomotor system remains unknown. V4 could...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2018